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DETAILED ACTION 



Response to Amendment 

Applicants' arguments filed on 1 1/24/04. Claims 32-37 are newly added. Claims 
1-37 are presented for examinations. 

Drawings 

The drawings were received on 1 1/24/04. These drawings are accepted. 

Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary sl^ill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

This application currently names joint inventors. In considering patentability of 

the claims under 35 U.S.C. 103(a), the examiner presumes that the subject matter of 

the various claims was commonly owned at the time any inventions covered therein 

were made absent any evidence to the contrary. Applicant is advised of the obligation 

under 37 CFR 1 .56 to point out the inventor and invention dates of each claim that was 



not commonly owned at the time a later invention was made in order for the examiner to 
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consider the applicability of 35 U.S.C. 103(c) and potential 35 U.S.C. 102(e). (f) or (g) 
prior art under 35 U.S.C. 1 03(a). 

Claims 1, 6-19, 28-34 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Steinkraus (US Patent 6,363,373 B1 ) in view of Bond et al (US Patent 6,539,348 
81). 

As per claim 1, Steinkraus discloses the recited limitations as follows: 
'A computer-implemented method of generating concept units from user search 
queries' as the search queries input by the user wherein the search terms are broken 

I 

into word to/cens [See col. 2, lines 40-45] and the search engine uses the word tokens ' 
to generate a list of concept tokens [See col. 2, line 34]. The list of content tokens is 
used to search for concepts [See col. 2, line 58-59], the method comprising: 

'receiving a plurality of queries' as preprocessing search queries input by the 
users [See col. 7, line 30 and line 36], 'each query comprising a string of one or more 
words' as the query contains words, word-like strings, numbers [See col. 7, lines 38-39]; 

'tokenizing each query string to produce one or more tokens for each query' as 
the query is read and broken into word tokens [See abstract, lines 13-14], 'wherein said 
tokens for said queries form an initial set of units' as the queries are broken and 
produced a series of word tokens [See col. 5, lines 35-36]; 

'until a convergence condition is satisfied' as the concept tokens weights are less 
than a threshold value [See col. 8, lines 8-10]. 
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Steinkraus does not explicitly teach combining units from the initial set of units 
that appear adjacent each other in a query to form a second set of units, validating the 
second set of units, repeating the steps of combining and validating one or more times 
using the second set of units in place of the initial set of units, wherein a final set of units 
is formed once the convergence condition has been satisfied and storing the final set of 
units to a memory. 

However, Bond teaches, an analogous system for tokenizing words of a natural 
language [See abstract, lines 2-3]. In particular. Bond teaches: 

'combining units from the initial set of units that appear adjacent each other in a 
query to form a second set of units' as a token merging program for merging the 
sequence of a phrase or clause [See col. 5, line 51 , line 35, col. 1 , line 26], the 
processor iterates matching consecutive tokens against the first set of rule [See Fig. 1 , 
# 26, col. 5, lines 26-27]. The processor will continues to iterate through the tokenized 
of words until further word tokens can be made by processor [See col. 5, lines 27-31]. 
Bond explained as the processor performs matching consecutive tokens in the resulting 
narrower set of possible syntactic interpretations against a second set of rules [See col. 
5, lines 32-33]; 

'validating the second set of units' as the processor again, processes the second 
set of rules until no further narrowing of possible syntactic interpretations [See col. 5, 
lines 36-37]; 

'repeating the steps of combining and validating one or more times using the 
second set of units in place of the initial set of units' as the process proceeds into 
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reiteration program code [See col. 5, lines 38-40], 'wherein a final set of units is formed 
once the convergence condition has been satisfied' as tlie completion of the iterate 
inductive processing output tlie syntactic possible interpretations marked text [See Fig. 1 . 
# 38, col. 5, lines 42-44]; and 

'storing the final set of units to a memory' as the syntactically marked text is 
output and retained in a software object to permit further analysis by the users [See col. 
6, lines 13-17]. 

Thus, it would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the teachings of the cited references to implement the 
steps of combining units, validating all units that adjacent each other, and repeating the 
steps of combining until a convergence condition is satisfied, then storing the final set of 
units to memory. Bond's teachings would allow users of Steinkraus's system to readily 
determine the appropriate syntax for a natural language query to obtain meaningful 
interpretations and analyses of enormous amount of information to conveniently search 
for the concepts associated with any given user's needs as suggested by Bond [See 
col. 1 , lines 29-40. col. 2, line 62-63]. 

As per claim 6, Bond teaches 'including generating unit extensions using the final 
set of units' as the completion of syntactic possibilities interpretations are passed on to 
deductive token merging code. The deductive code reviews possible interpretations 
and determines which ones are remaining for the inductive merging processes [See col. 
5, lines 59-64]. 
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As per claim 7, Bond teaclies 'wherein generating unit extensions includes 
identifying units that are subsets of other units' as all words in a sentence are tokenized 
into multiple words [See col. 4, lines 20-21]. Because Bond teaches that each of word 
in a sentence is a 'word token\ which is equivalent to the claim limitation "unit". Bond 
disclosed that an article: ''United States"[See coL 4Jine 19] is tokenized into word 
tokens, ''United" and ''States". Therefore, the "United" is a subset of other "unit", 
wherein the other unit is a "States", 

As per claim 8, Bond teaches 'storing the unit extensions to the memory' as the 
syntactically marked text is output and retained in a software object to permit further 
analysis [See col. 6, lines 13-15]. 

As per claim 9, Bond teaches 'generating unit associations using the final set of 
units' as a listing of words and their associated part of speech is analyzed [See col. 3, 
lines 32-40] bypassing the syntactic possibilities (the final set of units) on to deductive 
token merging code. The deductive code reviews possible interpretations and 
determines which ones are remaining for the inductive merging processes [See col. 5, 
lines 59-64]. 

As per claim 10, Bond teaches 'wherein generating unit associations includes 
identifying units that are associated with other units' a listing of words and their 
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associated part of speech is analyzed [See col. 3, lines 32-40]. Bond teaches that each 
of word in a sentence is a 'word token\ which is equivalent to the claim limitation "unit". 
Bond disclosed that an article: "United States" [See col. 4,line 19] is tokenized into word 
tokens, ''United" and ''States". Therefore, the "United" is a subset of other "unit", 
wherein the other unit is a "States". 

As per claim 1 1 , Bond teaches 'storing the unit associations to the memory' as 
the syntactically marked text is output and retained in a software object to permit further 
analysis [See col. 6, lines 13-15]. 

As per claim 12, Bond teaches 'wherein identifying associated units includes 
determining which units appear in queries with other units' as a determination of which 
syntactic identifier is the most commonly used for a given word [See col. 6, lines 3-5]. 

As per claim 13, Bond teaches 'generating unit alternatives after the 
convergence condition has been satisfied' as a syntactic sequence that fits the definition 
of a clause or consecutive tokens [See col. 5, lines 32-35]. It should be noted that the 
instant specifications describes a "unit alternatives" as a "sequence of words" [See 
Applicant's specification page 13, line 25]. 

As per claim 14, Bond teaches 'wherein generating unit alternatives includes 
determining whether an edit distance between two units in the final set of units is 
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smaller than a threshold value' as the possible syntactic interpretations of the word 
tokens of the sentence may be deleted or replaced by identifiers covering a smaller 
class of words [See col. 2, lines 7-9], 'and if so, comparing the relative frequencies of 
the two units' as the series of tokens are used consecutively and compared with a first 
list of rules in order to produce a narrower set of possible syntactic interpretations [See 
col. 4, lines 57-58, col. 2, lines 5-7]. 

As per claim 15, Bond teaches: 

'generating unit extensions using the final set of units' as the syntactic 
possibilities are passed on to deductive token merging code. The deductive code 
reviews possible interpretations and determines which ones are remaining for the 
inductive merging processes [See col. 5, lines 59-64]; 

'generating unit associations using the final set of units' as a listing of words and 
their associated part of speech is analyzed [See col. 3, lines 32-40] by passing the 
syntactic possibilities (the final set of units) on to deductive token merging code. The 
deductive code reviews possible interpretations and determines which ones are 
remaining for the inductive merging processes [See col. 5, lines 59-64]; and 

'generating unit alternatives using the final set of units' as a syntactic sequence 
that fits the definition of a clause or consecutive tokens [See col. 5, lines 32-35]. It 
should be noted that the instant specifications describes a "unit alternatives" as a 
"sequence of words" [See Applicant's specification page 13, line 25]. 
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As per claim 16, Bond teaches 'storing the unit extensions, the unit associations 
and the unit alternatives to the memory' as the syntactically marked text is output and 
retained in a software object to permit further analysis by the users [See col. 6, lines 
13-17]. 

As per claim 17, Bond teaches: 

'wherein generating unit extensions includes identifying units that are subsets of 
other units' as all words in a sentence are tokenized into multiple words [See col. 4, 
lines 20-21]. Because Bond teaches that each of word in a sentence is a 'word token', 
which is equivalent to the claim limitation "unit". Bond disclosed that an article: ''United 
States" [See col. 4,line 19] is tokenized into word tokens, "United" and "States" 
Therefore, the "United" is a subset of other "unit", wherein the other unit is a ''States", 

Wherein generating unit associations includes identifying units that are 
associated with other units' a listing of words and their associated part of speech is 
analyzed [See col. 3, lines 32-40]. Bond teaches that each of word in a sentence is a 
'word token\ which is equivalent to the claim limitation "unit". Bond disclosed that an 
article: "United Sfates"[See col. 4,line 19] is tokenized into word tokens, "United" and 
"States". Therefore, the "United" is a subset of other "unit", wherein the other unit is a 
"States", and 

'wherein generating unit alternatives includes determining whether an edit 
distance between two units in the final set of units is smaller than a threshold value' as 
possible syntactic interpretations of the word tokens of the sentence may be deleted or 
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replaced by identifiers covering a smaller class of words [See col. 2. lines 7-9], 'and if 
so, comparing the relative frequencies of the two units' as the series of tokens are used 
cor)secutively and compared with a first list of rules in order to produce a narrower set of 
possible syntactic interpretations [See col. 4, lines 57-58, col. 2, lines 5-7]. 

As per claim 18, Bond teaches 'wherein validating includes for each combined 
unit in the second set of units' as the processor performs matching consecutive tokens 
in the resulting narrower set of possible syntactic interpretations against a second set of 
rules through the inductive merging program [See col. 5, lines 31-33, 51], 'comparing a 
frequency of occurrence of the combined unit with a frequency of occurrence of each 
constituent unit in the combined unit' as the series of tokens are used consecutively and 
compared with a first list of rules in order to produce a narrower set of possible syntactic 
interpretations [See col. 4, lines 57-58, col. 2, lines 5-7]. 

As per claim 19, Bond teaches 'wherein the convergence condition includes a 
threshold value, wherein the convergence condition is satisfied if a change in the 
number of units in the two second set of units between successive steps of combining 
and validating is smaller than or equal to the threshold value' as possible syntactic 
interpretations of the word tokens of the sentence may be deleted or replaced by 
identifiers covering a smaller class of words [See col. 2, lines 7-9]. 

As per claim 28, Steinkraus teaches the recited limitations as follows: 
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'a computer readable medium including code for causing a processor to generate 
concept units from a plurality of user search queries' as the search queries input by the 
user wherein the search terms are broken into word tokens [See col. 2, lines 40-45] and 
the search engine uses the word tokens to generate a list of concept tokens [See col. 2, 
line 34]. The list of content tokens is used to search for concepts [See col. 2, line 58- 
59], 'each query comprising a string of one or more words' as the query contains words, 
word-like strings, numbers [See col. 7, lines 38-39], 'wherein the code includes 
instructions to: 

a) tokenize each query string to produce one or more tokens for each query' as 
the query is read and broken into word tokens [See abstract, lines 1 3-14], 'wherein said 
tokens for said queries form an initial set of units' as the queries are broken and 
produced a series of word tokens [See col. 5, lines 35-36]; 

'until a convergence condition is satisfied' as the concept tokens weights are less 
than a threshold value [See col. 8, lines 8-1 0]. 

Steinkraus does not explicitly teach b) combine units from the initial set of units 
that appear adjacent each other in a query to form a second set of units, c) validate the 
second set of units, d) repeat b) and c) one or more times using the second set of units 
in place of the initial set of units, wherein a final set of units is formed once the 
convergence condition has been satisfied and store the final set of units to a memory 
module. 

However, Bond teaches, an analogous system for tokenizing the words of the 
natural language [See abstract, lines 2-3]. In particular, Bond teaches: 
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'b) combine units from the Initial set of units that appear adjacent each other in a 
query to form a second set of units' as a token merging program for merging the 
sequence of a phrase or clause [See col. 5, line 51 , line 35, col. 1 , line 26], the 
processor iterates matching consecutive tokens against the first set of rule [See Fig. 1 , 
# 26, col. 5, lines 26-27]. The processor will continues to iterate through the tokenized 
of words until further word tokens can be made by processor [See col. 5, lines 27-31]. 
Bond explained as the processor performs matching consecutive tokens in the resulting 
narrower set of possible syntactic interpretations against a second set of rules [See col. 
5. lines 32-33]; 

'c) validate the second set of units' the processor again, processes the second 
set of rules until no further narrowing of possible syntactic interpretations [See col. 5, 
lines 36-37]; 

'd) repeat b) and c) one or more times using the second set of units in place of 
the initial set of units' as the process proceeds into reiteration program code [See col. 5, 
lines 38-40], 'wherein a final set of units is formed once the convergence condition has 
been satisfied' as the completion of the iterative inductive processing output the 
syntactic possible interpretations marked text [See Fig. 1, # 38, col. 5, lines 42-44]; and 

'store the final set of units to a memory module' as the syntactically marked text 
is output and retained in a software object to permit further analysis by the users [See 
col. 6, lines 13-17]. 

Thus, it would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the teachings of the cited references to implement the 
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steps of combining units, validating all units that adjacent each other, and repeating the 
steps of combining until a convergence condition is satisfied, then storing the final set of 
units to memory. Bond's teachings would allow users of Steinkraus's system to readily 
determine the appropriate syntax for a natural language query to obtain meaningful 
interpretations and analyses of enormous amount of information to conveniently search 
for the concepts associated with any given user's needs as suggested by Bond [See 
col. 1, lines 29-40, col. 2, line 62-63]. 

As per claim 29, Bond teaches 

'wherein the code further includes instructions to: generate unit extensions using 
the final set of units' as the completion of syntactic possibilities interpretations are 
passed on to deductive token merging code. The deductive code reviews possible 
interpretations and determines which ones are remaining for the inductive merging 
processes [See col. 5, lines 59-64]; 

'generate unit associations using the final set of units' as a listing of words and 
their associated part of speech is analyzed [See col. 3, lines 32-40] by passing the 
syntactic possibilities (the final set of units) on to deductive token merging code. The 
deductive code reviews possible interpretations and determines which ones are 
remaining for the inductive merging processes [See col. 5, lines 59-64]; 

'generate unit alternatives using the final set of units' as a syntactic sequence 
that fits the definition of a clause or consecutive tokens [See col. 5, lines 32-35]. It 
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should be noted that unit alternatives is described in the specification is a "sequence of 
words" [See Applicant's specification page 13, line 25]; and 

'store the unit extensions, unit associations and unit alternatives to the memory 
module in association with the final set of units' the syntactically marked text is output 
and retained in a software object to permit further analysis by the users [See col. 6, 
lines 13-17]. 

As per claim 30, Bond teaches: 

'wherein the instructions to generate unit extensions includes instructions to 
identify units that are subsets of other units' as all words in a sentence are tokenized 
into multiple words [See col. 4, lines 20-21]. Because Bond teaches that each of word 
in a sentence is a 'word token', which is equivalent to the claim limitation "unit". Bond 
disclosed that an article: "United States" [See col. 4,line 19] is tokenized Into word 
tokens, "United" and "States". Therefore, the "United" is a subset of other "unit", 
wherein the other unit is a "States", 

Wherein the instructions to generate unit associations includes instructions to 
identify units that are associated with other units' a listing of words and their associated 
part of speech is analyzed [See col. 3, lines 32-40]. Bond teaches that each of word in a 
sentence is a 'word token', which is equivalent to the claim limitation "unit". Bond 
disclosed that an article: "United States" [See col. 4,line 1 9] Is tokenized into word 
tokens, "United" and "States". Therefore, the "United" is a subset of other "unit", 
wherein the other unit is a "States", and 



Application/Control Number: 10/713,576 Page 15 

Art Unit: 2167 

'wherein the instructions to generate unit alternatives includes instructions to 
determine whether an edit distance between two units in the final set of units is smaller 
than a threshold value' as possible syntactic interpretations oftlie word tokens of the 
sentence may be deleted or replaced by identifiers covering a smaller class of words 
[See col. 2, lines 7-9], 'and if so, compare the relative frequencies of the two units' as 
the series of tokens are used consecutively and compared with a first list of rules in 
order to produce a narrower set of possible syntactic interpretations [See col. 4, lines 
57-58, col. 2, lines 5-7]. 

As per claim 31 , Steinkraus teaches 'wherein each word comprises one or a 
plurality of alphanumeric characters' as word tokens maybe words, word-like strings, 
numbers, eta [See col. 7, lines 38-39]. 

As per claim 32, Bond teaches 'wherein storing the final set of units to a memory 
comprising storing the final set of units to a unit dictionary' as the word can be marked 
as an unknown, then the dictionary can be continually updated to include results 
established for unknown through the analyzed process (col. 3, lines 52-61 ). 



As per claim 33, Bond teaches 'wherein the memory unit comprises a unit 
dictionary, and wherein storing a final set of units to the memory unit comprises storing 
the final set of units to the unit dictionary' as the word can be marked as an unknown. 
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then the dictionary can be continually updated to include results established for 
unknown through the analyzed process (col. 3, lines 52-61). 

As per claim 34, Bond teaches 'wherein storing the final set of units to a memory 
module comprises storing the final set of units to a unit dictionary' as the word can be 
marked as an unknown, then the dictionary can be continually updated to include 
results established for unknown through the analyzed process (col. 3, lines 52-61 ). 

Claims 2-5, 20-27 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Stelnkraus (US Patent 6,363,373 B1 ) in view of Bond et a! (US Patent 6,539,348 
B1) and further in view of Whitman et al (US Patent 6,772,150 B1). 

As per claim 2, Steinkraus and Bond do not teach wherein receiving includes 
receiving one or more query log files, each query log file including a plurality of queries. 
However, Whitman teaches 'receiving one or more query log files' as a web server, 
maintains a daily transaction log file of all the requests it has received from web 
browsers [See Fig. 3, col. 9, lines 1-3], 'each query log file including a plurality of 
queries' as the daily transaction log file 135 is made up several entries, each entry 
containing information about a different request [See Fig. 3,col. 9, lines 3-5]. Thus, it 
would have been obvious to one of ordinary skill in the art at the time the invention was 
made, to combine the teachings of the cited references, because Whitman's teaching of 
daily transaction log file that received all of the requests from web browsers would allow 
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users of Steinkraus and Bond's system to utilize the transaction log file as a references 
or as a look-up table, to select the previous related search phrases to be suggested to 
the user when the user submits a search query which has a similar context of search 
terms. The related query terms suggests to the user as a part of the search refinement 
process. The goal of this process is to produce a refined search query that more 
narrowly specifies the user's intended requests, as explained in Whitman [See col. 1, 
lines 54-58, col. 2, line 51-53, col. 12, lines 49-50]. 

As per claim 3, Steinkraus and Bond do not teach consolidating the plurality of 
queries from the one or more query log files into a single consolidated query file. 
However, Whitman teaches 'consolidating the plurality of queries from the one or more 
query log files into a single consolidated query file' as combining new data with existing 
data from earlier constituent time periods to form a collective search phrase table (log 
file) [See Fig. 4, #440, 450. col. 8, lines 38-40, 49-51]. Thus, it would have been 
obvious to one of ordinary skill in the art at the time the invention was made to combine 
the teachings of the cited references because merging or combining of transaction log 
files as disclosed by Whitman's system would allow users of Steinkraus and Bond to 
eliminate the duplicate phrases [See col. 9, lines 48-53, col. 10, lines 45-46, col. 13, 
lines 30-35] so that the selection process selects related search phrases to be 
suggested to the user is invoked only for single term queries [See col. 12, line 64]. An 
important benefit of this method is that it is highly efficient, allowing the query result 
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page to be returned without adding appreciable delay, as suggested in Whitman [See 
col. 12, lines 51-65]. 

As per claim 4, Whitman teaches 'wherein consolidating includes removing 
duplicates of queries and incrementing a count associated with each individual query 
each time a duplicate of said individual query is removed, wherein the consolidated file 
includes a list of individual queries and counts associated therewith' as the search 
phrases can be done by sorting alphabetically and removing duplicate phrases [See 
col. 13, lines 31-35], \fthe search phrase already exists under the key term pair, the 
process increments the search phrase's score [See col. 10, lines 63-64]. 

As per claim 5, Whitman teaches 'wherein the received query log files include 
query log files for each day of a week' as daily results file for 
7-Feb-98 [See Fig. 6, col. 12, line 8], 'and wherein consolidating includes forming a 
single consolidated query file including queries for the week' as generating a new query 
table for the period of the last seven days 1-Feb-98 to 7-Feb-98 [See Fig. 6, col. 12, 
lines 11-19]. 

As per claim 20, Steinkraus teaches; 

'receiving an individual query from a user' as a search query input by the user 
[See Fig. 3, col. 7, line 36]; 
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'identifying one or more units in the individual query' as the query is read and 
broker) into word tokens [See abstract, lines 13-14]; and 

'unit alternatives' as a syntactic sequence that fits the definition of a clause or 
consecutive tokens [See col. 5, lines 32-35]. It should be noted that the instant 
specifications describes a "unit alternatives" as a "sequence of words" [See Applicant's 
specification page 13, line 25], 

Steinkraus does not explicitly teach using one or more of the unit extensions, unit 
associations stored in the memory In association with the one or more units identified in 
the individual query. However, Bond teaches 'using one or more of the unit extensions' 
as the syntactic possibilities are passed on to deductive token merging code. The 
deductive code reviews possible interpretations and determines which ones are 
remaining for the inductive merging processes [See col. 5, lines 59-64], 'unit 
associations' as a listing of words and their associated part of speech is analyzed [See 
col. 3, lines 32-40] by passing the syntactic possibilities (the final set of units) on to 
deductive token merging code. The deductive code reviews possible interpretations 
and determines which ones are remaining for the inductive merging processes [See col. 
5, lines 59-64] and 'stored in the memory in association with the one or more units 
identified in the individual query' as the syntactically marked text is output and retained 
in a software object to permit further analysis [See col . 6, lines 13-15], Thus, it would 
have been obvious to one of ordinary skill in the art at the time the invention was made 
to combine the teachings of the Steinkraus and Bond's system, because the beneficial 
of Bond's teaching of using the unit extension, unit association stored in the memory in 
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association with units identified in the individual query would offer users of Steinkraus's 
system to directly access to the syntax parse independent without the need for 
customizing of any semantic information such as happens in noun and verb classes, as 
explained in Bond [See col. 6, lines 16-21]. 

Steinkraus and Bond do not explicitly teach determining one or more suggestions 
to provide to the user responsive to the query. However, Whitman teaches 'determining 
one or more suggestions to provide to the user responsive to the query' as when a user 
submits a search query, a query processing component uses the search phrases table 
137 of Figure 1, to look up one or more related search phrases to suggest to the user as 
alternative queries [See col. 2, lines 52-54]. Thus, it would have been obvious to one of 
ordinary skill In the art at the time the invention was made, to combine the teachings of 
the cited references because Whitman's system would allow users of Steinkraus and 
Bond's system to utilize a search phrases table 137 to select the related search phrases 
to be suggested to the user when the user submits a search query which has a similar 
context of search terms. The related query terms suggests to the user as a part of the 
search refinement process. The goal of this process is to produce a refined search 
query that more narrowly specifies the user's intended requests, as explained in 
Whitman [See col. 1, lines 54-58, col. 2, line 51-53, col. 12, lines 49-50], 

As per claim 21 , Steinkraus discloses the recited limitations as follows: 
'a system for generating concept units from user search queries' as the search 
queries input by the user wherein the search terms are broken into word tokens [See 
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col. 2, lines 40-45] and the search engine uses the word tokens to generate a list of 
concept tokens [See col. 2, line 34]. The list of content tokens is used to search for 
concepts [See col. 2. line 58-59], the system comprising: 

'a memory unit' as a memory includes RAM and ROM [See Fig. 1 , # 22, # 24 and 
#25]; and 

'a processing module' as the program modules [See col. 4, line 19], 'configured 
to receive one or more query files' as preprocessing search queries input by the users 
[See col. 7, line 30 and line 36], 'each query including a string of one or more words' as 
the query contains words, word-like strings, numbers [See col. 7, lines 38-39], *and 
wherein the processing module is further configured to: 

tokenize each query from the query files' as the query is read and broken into 
word tokens [See abstract, lines 13-14] 'to produce an initial set of units' as the queries 
are broken and produced a series of word tokens [See col. 5, lines 35-36]; and 

'thereafter, iteratively, until a convergence condition is satisfied' as the concept 
tokens weights are less than a threshold value [See col. 8, lines 8-10]. 

Steinkraus does not explicitly teach combine units from the initial set of units that 
appear adjacent each other in a query to form a second set of units, validate the second 
set of units, wherein the second set of units is used for each iteration, and once the 
convergence condition has been satisfied, store a final set of units to the memory unit. 

However, Bond teaches, an analogous system for tokenizing words of a natural 
language [See abstract, lines 2-3]. In particular, Bond teaches: 
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'combine units from the initial set of units that appear adjacent each other in a 
query to form a second set of units' as a token merging program for merging the 
sequence of a phrase or clause [See col. 5, line 51 , line 35, col. 1 , line 26], the 
processor iterates matching consecutive tokens against the first set of rule [See Fig. 1 , 
# 26, col. 5, lines 26-27]. The processor will continues to iterate through the tokenized 
of words until further word tokens can be made by processor [See col. 5, lines 27-31]. 
Bond explained as the processor performs matching consecutive tokens in the resulting 
narrower set of possible syntactic interpretations against a second set of rules [See col. 
5, lines 32-33]; and 

'validate the second set of units' as the processor again, processes the second 
set of rules until no further narrowing of possible syntactic interpretations [See coL 5, 
lines 36-37], 'wherein the second set of units is used for each iteration' as the process 
proceeds into reiteration program code [See col. 5, lines 38-40]; and 

'once the convergence condition has been satisfied' as the completion of the 
iterative inductive processing output the syntactic possible interpretations marked text 
[See Fig. 1 , # 38, col. 5, lines 42-44], 'store a final set of units to the memory unit' as the 
syntactically marked text is output and retained in a software object to permit further 
analysis by the users [See col. 6, lines 13-17]. 

Thus, it would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the teachings of the cited references to implement the 
steps of combining units, validating all units that adjacent each other, and repeating the 
steps of combining until a convergence condition is satisfied, then storing the final set of 
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units to memory. Because Bond*s teachings would allow users of Steinkraus's system 
to readily determine the appropriate syntax for a natural language query to obtain 
meaningful interpretations and analyses of enormous amount of information to 
conveniently search for the concepts associated with any given user's needs as 
suggested by Bond [See col. 1, lines 29-40. col. 2, line 62-63]. 

Steinkraus and Bond do not teach receive one or more query log files, each 
query log file including a plurality of queries. However, Whitman teaches 'receive one 
or more query log files' as a web server, maintains a daily transaction log file ofal the 
requests it has receives from web browsers [See Fig. 3, col. 9, lines 1-3], 'each query 
log file including a plurality of queries' as the daily transaction log file 135 is made up 
several entries, each entry containing information about a different request [See Fig. 
3,col. 9, lines 3-5]. Thus, it would have been obvious to one of ordinary skill in the art at 
the time the invention was made, to combine the teachings of the cited references, 
because Whitman's teaching of daily transaction log file that received all of the requests 
from web browsers would allow users of Steinkraus and Bond's system to utilize the 
transaction log file as a references or as a look-up table, to select the previous related 
search phrases to be suggested to the user when the user submits a search query 
which has a similar context of search terms. The related query terms suggests to the 
user as a part of the search refinement process. The goal of this process is to produce 
a refined search query that more narrowly specifies the user's intended requests, as 
explained in Whitman [See col. 1, lines 54-58, col. 2, line 51-53, col. 12, lines 49-50]. 
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As per claim 22, Steinkraus teaches 'one or more query log file sources for 
providing the query log file' as a distributed computing environments [See col. 3, lines 
53-55]. 

As per claim 23, Bond teaches 

'wherein the processing module is further configured to: generate unit extensions 
using the final set of units' as ttie completion of syntactic possibilities interpretations are 
passed on to deductive token merging code. The deductive code reviews possible 
interpretations and determines which ones are remaining for the inductive merging 
processes [See col. 5, lines 59-64]; 

'generate unit associations using the final set of units' as a listing of words and 
their associated part of speech is analyzed [See col. 3, lines 32-40] bypassing the 
syntactic possibilities (the final set of units) on to deductive token merging code. The 
deductive code reviews possible interpretations and determines which ones are 
remaining for the inductive merging processes [See col. 5, lines 59-64]; 

'generate unit alternatives using the final set of units' as a syntactic sequence 
that fits the definition of a clause or consecutive tokens [See col. 5, lines 32-35]. It 
should be noted that unit alternatives is described in the specification is a "sequence of 
words" [See Applicant's specification page 13, line 25]; and 

'store the unit extensions, unit associations and unit alternatives to the memory in 
association with the final set of units' the syntactically marked text is output and retained 
in a software object to permit further analysis by the users [See col. 6, lines 13-17]. 
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As per claim 24, Steinkraus and Bond do not teach wherein the received query 
log files include query log files for each day of a week, and wherein the processing 
module is further configured to consolidate the query log files into a single consolidated 
query file consisting of queries for the week. However, Whitman teaches 'wherein the 
received query log files include query log files for each day of a week' as daily results 
file for 7-Feb-98 [See Fig. 6, col. 12, line 8], 'and wherein the processing module is 
further configured to consolidate the query log files into a single consolidated query file 
consisting of queries for the week' as gerierating a new query table for the period of the 
last seven days 1-Feb-98 to 7'Feb-98 [See Fig. 6, col. 12, lines 11-1 9]. Thus, it would 
have been obvious to one of ordinary skill in the art at the time the invention was made 
to combine the teachings of the cited references because merging or combining of 
transaction log files to produce a new search phrase table as disclosed by Whitman's 
system [See col. 10, Iines50-51] is equivalent to the claim "consolidate the query log 
files into a single consolidated query file", would allow users of Steinkraus and Bond to 
eliminate the duplicate phrases [See col. 9, lines 48-53, col. 10, lines 45-46, col. 13, 
lines 30-35] so that the selection process selects related search phrases to be 
suggested to the user is invoked only for single term queries [See col. 12, line 64]. An 
important benefit of this method is that it is highly efficient, allowing the query result 
page to be returned without adding appreciable delay, as suggested in Whitman [See 
col. 12, lines 51-65]. 
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As per claim 25, Whitman teaches 'wherein the processing module consolidates 
by removing duplicates of queries and incrementing a count associated with each 
Individual query each time a duplicate of said individual query is removed, wherein the 
consolidated file includes a list of individual queries and counts associated therewith' as 
the search phrases can be done by sorting alphabetically and removing duplicate 
phrases [See col. 13, lines 31-35], \ fthe search phrase already exists under the key term 
pair, the process increments the search phrase's score [See col. 10, lines 63-64]. 

As per claim 26, Whitman teaches 'wherein the processing module determines a 
frequency of occurrence for each unit using the counts associated with the queries' as a 
score is maintained for each search phrase to reflect how many times the search 
phrase appeared in the transaction log [See col. 1 1 , lines 4-6], 'and wherein the 
processing modules stores the unit frequencies to the memory unit in association with 
the final set of units' as the final values of the scores taken over total number of days 
and are stored within the search phrase table [See col. 7, lines 6-7]. 

As per claim 27, Steinkraus teaches 'wherein the memory unit and processing 
module are implemented in a search server device in a network' as the computing 
distributed environments where tasks are performed by remote processing devices that 
are linked through a communications network [See col. 3, lines 53-55]. 
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Allowable Subject Matter 

The prior art of record fails to teach detecting (a) how often each of the plurality 
of the units appears by itself separate others of the plurality of units, detecting (b) how 
often two or more of the plurality of units appear next to each other across an entire set 
of queries and comparing a result Of (a) with a result of (b). 

Claims 35-37 are objected to as being dependent upon a rejected base claim, 
but would be allowable if rewritten in independent form including all of the limitations of 
the base claim and any intervening claims. 



Response to Arguments 

Applicant's arguments filed 1 1/24/04 have been fully considered but they are not 
persuasive. 

Applicants argued that Steinkrus does not teach convergence condition based on 
an example provided by the applicant's specification, paragraph 0038. 

In response, examiner respectfully disagrees. The examiner submits that an 
example of convergence: "the point beyond which the change in units 
generated/deleted is smaller than some pre-defined threshold value" as three concept 
identifiers, afternormalization, their normalized weights would be 386, 731, and 1000. 
The normalized weights 386, 731 and 1000 are equivalent to the "pre-defined threshold 
value" and their highest weight between the three is 1000 (col. Lines 45-49). Steinkrus 
teaches a method continues until nonvalized weight to 1000 (col. 9, lines 20-22). The 
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instant application claimed limitation "a convergence condition is satisfied" is the point 
when the pre-defined threshold value has reached as until normalized weight reaches 
to 1000. Therefore, the examiner has interpreted the claim limitation in light of the 
specification with the above passages. Thus, Steinkrus does teach the claim limitation 
"a convergence". 



Conclusion 

THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1 .1 36(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 

Any inquiry concerning this communication or eariier communications from the 
examiner should be directed to DEBBIE M LE whose telephone number is (571) 272- 
4111. The examiner can normally be reached on 8:30-5:00. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, JOHN BREENE can be reached on (571) 272-4107. The fax phone number 
for the organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status Information for unpublished applications Is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
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